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Abstract 

ri/n 



I't'I Wc consider the models Yi^n = Jq '^is)dWs + T{i/n)ei,„, and Yi^n = <^i'i/n)Wi/n + 

T{i/n)ei^nj i = where {Wt)t^\fi\\ denotes a standard Brownian motion and 

^ . Ci^n are centered i.i.d. random variables with E (ef^) = 1 and finite fourth mo- 

ment. Furthermore, a and r are unknown deterministic functions and (Wt)jg[g ^ and 

^ (ei,n) ■ ■ • ) En.n) are assumed to be independent processes. Based on a spectral decom- 

position of the covariancc structures we derive series estimators for cr^ and and 
investigate their rate of convergence of the MISE in dependence of their smoothness. 

^f) To this end specific basis functions and their corresponding Sobolev ellipsoids are in- 

troduced and we show that our estimators are optimal in minimax sense. Our work 

^>f-^ is motivated by microstructure noise models. A major finding is that the microstruc- 

QQ ture noise e^.n introduces an additionally degree of ill-poscdncss of 1/2; irrespectively 

of the tail behavior of e,^„. The performance of the estimates is illustrated by a small 
numerical study. 
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1 Introduction 

Consider the models 



i/n 



and 



Yi,„= 1^ a{s)dWs + T \^-] Ci^n i = l,...,n, (1.1) 



Yi,n = a[^jW,/^ + TU]ei,n i = l,...,n (1.2) 
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respectively, where (W^t)tg[oi] denotes a Brownian motion and ej^„ is so called microstruc- 
ture noise, i.e. we assume ei^n i-i-d., E ^e^„^ = 1 and E ^e^„^ < oo. (^t)tg[o i] ^^"^ 
(ei^ri) - - - J en,n) are assumed to be independent, and a and r are unknown, positive and 
deterministic functions. 



Our models (1.1) and (1.2) are natural extensions of the situation when a and r are 
constant, which has been, in a slightly broader setting, previously considered by [8], [13], 
[14] and [23] among others. In the latter papers sharp minimax estimators were derived for 
cr^ and r^. The minimax rate for is and for it is n~^/^, and the corresponding 

constants for quadratic loss (MSE) being 8ro"^ and 2r^, respectively. To estimate a and 
T, maximum likelihood is feasible (see [23]) and achieves these bounds. Other efficient 
estimators where given by [8], |13j or [TJj. In our case, i.e. when a and r are functions 
these methods fail and techniques from nonparametric regression become necessary. We 
will postpone a more careful dicussion of models (1.1) and (1.2) to Section [2] 

Both models incorporate, as usually in high-frequency financial models, an additional 
noise term, denoted as microstructure noise (cf. [T] and [16] ) in order to model market 
frictions such as bid-ask spreads and rounding errors. In general, microstructure noise is 
often assumed as white noise process with bounded fourth moment. Therefore, we may 
interpret both models as obtaining data from transformed Brownian motions under addi- 
tional measurement errors. Particularly, our assumptions cover the important case when 
ei,n '~-AA(0,l). 



In this paper we try to understand how estimation of the functions a and r in (1.1) 
and (1.2) itself can be performed, i.e. the time derivative of the integrated volatility. To 
our knowledge, this issue has never been addressed before, a remarkable exception is [3] 
where a harmonic analysis technique is introduced in order to recover o"^ from noiseless 
data. A naive estimator of o"^ would be the derivative of an estimator of a'^{x)dx with 
respect to s. However, (numerical) differentiation of a'^{x)dx with respect to s yields 
an additional degree of ill-posedness and there are to the best of our knowledge no esti- 
mates and no theoretical results available how to estimate a'^ in our situation. Instead, we 
propose a regularized estimator for a and r that attains the minimax rate of convergence. 
Our estimator is a Fourier series estimator where we will estimate the single cosine Fourier 
coefficients, (T^(x) cos{kTTx)dx, A; = 0, 1, ... by a particular spectral estimator which is 
specifically tailor suited to this problem. The difficulty to estimate o"^ can be explained 
generically from the point of view of statistical inverse problem: Microstructure noise in- 
duces an additional degree of ill posedness -similar as in a deconvolution problem- which 
in our case leads to a reduction of the rate of convergence by a factor 1/2. Surprisingly, 
and in contrast to deconvolution, this is only reflected in the behavior of the eigenvalues of 
the covariance operator of the process in (1.1) and (1.2) and not in the tail behavior of the 
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Fourier transform of the error ej^„. 

We stress again that we are aware of the fact that our model assumes a deterministic 
function a and r, which only depends on time t and generalization to a (t, Xt) is not obvious 
and a challenge for further research. However, the purely deterministic case already helps 
us to reveal the daily pattern of the volatility and finally we believe that our analysis is an 
important step into the understanding of these models from the view point of a statistical 
inverse problems. 

Results: All results are obtained with respect to MISE-risk. Let a and (3 denote a certain 
smoothness of cr^ and r^, respectively. Roughly speaking, these numbers correspond to the 
usual Sobolev indices, although in our situation, a particular choice of basis is required, 
leading us to the definition of Sobolev s-ellipsoids (see Definition [l]) . Then we show that 
can be estimated at rate n"^/^^^"'"^) for /3 > l,a > 1/2 in model (1.1) and /3 > l,a > 3/4 in 
model ( 1.2 ). This corresponds to the classical minimax rates for the usual Sobolev ellipsoids 
without the Brownian motion term in (1.1) and (1.2). More interesting, we obtain for 
estimation of o"^ the n""/^^""''^) rate of convergence for a > 3/4, /3 > 5/4 in model (1.1) 
and a > 3/2, /3 > 5/4 in model (1.2). We will show that these rates are uniform for Sobolev 
s-ellipsoids. Lower bounds with respect to Holder classes for estimation of cr^ have been 
obtained in [17]. Here we will extend this result to Sobolev s-ellipsoids. It follows that the 
obtained rates are minimax, indeed. 

To summarize, our major finding is that in contrast to ordinary deconvolution the diffi- 
culty of estimation cr^ when corrupted by additional (microstructure) noise e, is generically 
increased by a factor of 1/2 within the s-ellipsoids. This is quite surprising because one 
might have expected that for instance Gaussian error leads to logarithmic convergence rates 
due to its exponential decay of the Fourier transform (see e.g. jl], [6], [7] and [11] for some 
results in this direction). We stress that for our method a minimal smoothness of a in 



(1.1) of a > 1/2 and in (1.2) of a > 3/2 is required. Although convergence rates are half 
compared with usual nonparametric regression, it turns out that for large sample sizes we 
get reasonable estimates for smooth functions cr^. Roughly speaking, the results imply that 
n data points for estimation of can be compared to the situation, when we have ^/n 
observation in usual nonparamteric regression. 

The work is organized as follows. In Sections [2] and [s] we will discuss models (1.1) and 

(1.2) in more detail, introduce notation and define the required smoothness classes, Sobolev 



s-ellipsoids (details can be found in AppendixjBj). Section 4.1 and Section 4.2 are devoted to 
estimate o"^ and r^, respectively, and to present the rates of convergence of the estimators 
(for a proof see Appendix |A]). Section [5] provides the minimax result. In Section [g] we 
briefly discuss some numerical results and illustrate the robustness of the estimator against 
non- normality and violations of the required smoothness assumptions for and r^. Some 
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further results and technicalities of Sections 4.1 and 4.2 are given in the supplementary 
material. 



2 Discussion of Models ([TT]) and 



In this subsection we briefly discuss the background from financial economics of model 



(1.1) and explore the differences between models (1.1) and (1.2). We may consider the 



processes (cr (t) Wt)ie[o,i] and (^f^ a (s) dWsj ^^^^ = {W {H {t))^^^^^^, H (t) := /g cr^ (s) ds 
as (inhomogeneously) scaled Brownian motions, where scaling takes place in space and in 
time, respectively. Hence we will refer to (c (i) W^t)tg[o i] (^Jo i^) dWg^ in the 
future as space-transformed (sBM) and time-transformed (tBM) Brownian motion. 



Model ( |1.1[ ): In the financial econometrics literature variations of model (1.1) are 



often denoted as high-frequency models, since iWt)^^^Q ^ is sampled on time points t = i/n 
and nowadays there is a vast amount of literature on volatility estimation in high-frequency 
models with additional microstructure noise term (see [2], [15], [26] and [2Z])- These kinds 
of models have attained a lot of attention recently, since the usual quadratic variation 
techniques for estimation of Jq a'^{x)dx lead to inconsistent estimators (cf. [2S])- 

We are aware of the fact, that in contrast to our model, volatility is modelled gen- 
erally not only as time dependent but also depending on the process itself, i.e. Yi^n = 
^ijn + T (i/'n) ei^n, i = l,...,n, dXt = a {t, Xt) dWt- An overview over commonly used 
parametric forms a (t, Xt) and a non-parametric treatment in the absence of microstruc- 
ture noise, can be found in [12]. It is known that the same rates as for the case a and 
r constant hold true if we consider the model ( [1.1[ ) and estimate the so called integrated 
volatility or realized volatility a'^{x)dx (s G [0, 1]) and T'^{x)dx instead of cr^ and r^, 
respectively (see [20] and [22j for a discussion on estimation of integrated volatility and 
related quantities). Recently, model ( |1.1| ) has been proven to be asymptotically equivalent 
to a Gaussian shift experiment (see [21J). o"^ as a function of time corresponds in model 



(1.1) to the instantaneous volatility or spot volatility. 



Model (1.2): Model (1.2) can be regarded as a nonparametric extension of the model 
with constant a, r as discussed for variogram estimation by |24j . To motivate the usefulness 
of sBM we give the following Lemma. 

Lemma 1. (i) Assume that a, < c < a, is continuously differentiahle. Then the 
corresponding sBM, {a (t) Wt)t^^Q ^ is the unique solution of the SDE 

dXt = Xtd (log {a (i))) + a (t) dWt, Xo = 0, 0<t<T. 
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(ii) The variogram of sBM is given by 



7(s,t) ■.= E{Xt-X, 



(a (t) _ ^ (s) ^1/2^ \a{t)a {s) 



s-t 



a/2 _ ^1/2 



Proof, (i) It is easy to check that sBM indeed is a solution. To estabhsh uniqueness, we 
apply Theorem 9.1 in [23]. (ii) This follows by straightforward calculations. □ 



Comparison of the models: We remark that tBM can be related to sBM by partial 
integration a (s) dWs = a (t) Wt - £ a' (s) Wsds. To see the differences we compared in 
Figure [l]sBM and tBM in two typical situations: The case where cr (t) = for t > T and 
the case, where a is non-continuous. If cr (t) = for t > T, sBM tends to zero, whereas 
tBM tends to a constant, i.e. the random variable a (s) dWg- Furthermore, if a is a 
jump function, sBM has a jump too, whereas tBM does not. 



Unlike Model (1.1), which can be viewed as a price process. Model (1.2) has no di 



rect application in financial mathematics. However, from the view point of nonparametric 
statistics it seems to be a natural extension of the situation when a and r are constant. 



3 Introduction to Sobolev s-Ellipsoids and Technical Prelim- 
inaries 

In this section we shortly introduce the setup needed in order to define the estimators. 
First we define suitable smoothness classes, which are different, but related to well known 



Sobolev ellipsoids (see Definition B.l). 

Definition 1. For a > 0, C > 0, we call the function space 
G, := G,(a,C7) := 



/ G l2[0, 1] : 3 {9n)„^n > t. fix) = + 2 ^ cos {inx) , ^ i^'^Of < C 



i=l i=l 



a Sobolev s-ellipsoid. If there is a C < oo such that / G G^ (a, C), we say f has smoothness 
a. For < I < u < oo, we further introduce the uniformly bounded Sobolev s-ellipsoid 

G^(a, C) := G^(a, C, [/, n]) := {/ G Q,(a, C) : I < f < u} . 

Here the "s" refers to symmetry" since the [0, 1] basis 

{^fc, fc = 0, . . .} := |l, \/2cos {kTTt) , /c = 1, . . .| , (3.1) 
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can also be viewed as a basis of the symmetric L^[— 1, 1] functions 

{/ : / e L2 [-1, 1] , fix) = fi-x) Vx G [0, 1]} . 
Usually, Sobolev ellipsoids are introduced with respect to the Fourier basis 

1, V2sm {2kiTt) , \/2cos (2/c7rt) , A: = 1, . . .| 



on L2 [0, 1] (see Definition (B.l)). As will turn out later on, Sobolev s-ellipsoids are more 
natural for our approach. If a function has a certain smoothness in one space, it might 
have a completely different smoothness with respect to the other basis. For instance the 
function cos ((2/ + 1) nx), I G N has smoothness a for all a < 00 with respect to basis (3.1), 
and as can be seen by direct calculations only smoothness a < 1/2 for the Fourier basis. A 
more precise discussion can be found in Part |B] of the Appendix. 

Instead of (3.1) it is convenient to introduce the functions fk : [0, 1] — )• M, G N 

fk{x) := V-fc ^ 



Note that for A; > 1, /| can be expanded in basis (3.1) by /| = V'o + 2^"'^/^Vfc- For any 
function g we introduce the forward difference operator Aig := g {{i + 1) /n) — g{i/n) 
and further the transformed variables AY^^ := (Yi^i^n — Yi^n) fj. (i/n) and AY^^^ : = 
^i+i,n — ^i,n) fk (V"")) « = 1 , • • • , w — 1 for models (1.1) and (1.2), respectively. In order 

/ = 1,2. Throughout 



to discuss the models simultaneously, we will write AYf^ = AY- 
the paper we abbreviate first order differences of observations by 



We write 



AY' := (AY,%...,AYti,n 



Ip and Dp for the space of p x g matrices, p x p matrices and p x p 



diagonal matrices over M, respectively. Further let -Dn-i G 
Y^2/n sin (zjvr / n) and define 



A 



i,n— 1 



4sin^ (ivr/ (2n)) i = l,...,n-l 



-1 given by {Dn-i)ij = 
(3.2) 



the eigenvalues of the covariance matrix Kn-i G M„_i of the MA(1) process Ajej_^ 



e,; , 



1, . . . , n — 1. More explicitly Kn-i is tridiagonal and 



n~l, 



2 for i = j 
— 1 for \i — j\ 
else 



(3.3) 



Note that we can diagonalize Kn-i explicitly by Kn~i = -Dn- 1 An- where A„_i is 



diagonal with diagonal entries given by (3.2). 
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We will suppress the index n — 1 and write D, A, Aj instead of Kn-i, Dn-i, A„_i, 
and Ai,n, respectively. We write [x\ := max^g^jz < j;}, x G M, the integer part of x. log() 
is defined to be the binary logarithm and in order to define estimators properly, we assume 
throughout the paper additionally n > 16. 



4 Estimators and Rates of Convergence 
4.1 Estimation of 

Before we will turn to the estimation of the volatility o"^, we will first discuss estimation of 
the noise variance, i.e. r^. Let G On-i given by 



(n — n/logn) ^ for [n/ logn] < i, j < n — 1 

otherwise 



where Aj is defined by (3.2) and 6ij denotes the Kronecker delta. We consider models (1.1) 

(4.1) 



and (1.2), simultaneously. Let 



4,0 := AyM djidHay'' 



In Lemma 



C.l 



it will be shown that t^fl is a -y/n— consistent estimator of 

1 







'T {x)fk {x) dx. 



Note that for k > 1 this means i^^o = Jq 'T''^{x)il^Q{x)dx + 2 "^/"^ T'^{x)'4)k{x)dx. Define 
Z := D i/SY^\ and denote by Zi the i-th component of Z. Then 



[n — n 



/logn)-^ K^Zl 

i=[n/ log n] 



(4.2) 



Hence this also can be seen as a spectral filter in Fourier domain, where we cut off the 
first n/logn frequencies. Note that for i > 1, 2^/^ (tj^o ~ ^o,o) = Jq T'^{x)'4'i{x)dx is the 
i-th series coefficient with respect to basis (3.1 ). This observation suggests to construct the 
cosine series estimator 

N 

i{t) := to,o + ^Y1 ~ '^^^ • (^•^) 



i=l 



The next result provides the rate of convergence of uniformly within Sobolev s- 
ellipsoids. To this end a version of the continuous Sobolev embedding theorem is required 



for non-integer indices a,/3 (see Lemma D.8). A proof of the following Theorem can be 
found in the supplementary material. 
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Theorem 1 (MISE of f^{t)). Let f^{t) as defined in (4-3). Assume /3 > 1, and Q,Q > 0. 
Further suppose that N = Nn = o {v}^"^ / \ognj . Assume either model (1.1) and a > 1/2 or 
model (1.2) and a > 3/4. Then it holds 

sup MISE (f^) = O (n-^^ + Nn-^) . 

Minimizing the r.h.s. yields N* = O (n^/C^/^^^)) and consequently 



sup MISE ('r^,)=0 

(T2ee^(a,Q),r2ee^(/3,Q) 



n 



-2/3/(2/3+1) 



Remark 1. Note that for model (l.!]) Theorem^ holds, whenever a > 1/2. Hence the 
Brownian motion part of the model can he viewed as a nuisance parameter, not affecting 
rates for estimation of t"^ . However, for model (1.2) a > 3/4 is required here. This more 
restrictive assumption is essentially a consequence of the fact that the process a (i/n) Wij^ 
is in general no martingale. 

Remark 2. The result from Theorem^ can be extended to 1/2 < /3 < 1 m model (1.1) and 



to 1/2 < a < 3/4, 1/2 < /3 < 1 in model 1^1^. Let t^fi he defined as ikfl in (4-1) but is 
now replaced by G ©n-i. 



Or 



2n-^Xr^8ij, for [n/2] <i,j <n-l 
otherwise 



Introduce further the estimator (t) = to,o + 2 YliLi (^«,o — *o,o) cos (i-Kt) . Further suppose 
that N = O (^n^/('^(^+^)'^ . Then we obtain by slight modifications of the proof of Theorem^ 
for 13 > 1/2, a > 1/2 and Q,Q>0 



(i) Assume model (1.1). Then it holds 

sup MISE (f|r) = O (n-^^ + Nn-^ + Nn^-^^) 

(72ee^(a,Q),T2ee^(/3,Q) 

and N* = (n(2/3-i)/(2/3+i)) yields 

sup MISE (4.) = O (^-(4/3^-2/3)/(2/3+l)\ . 

<72ee^(a,Q),T2ee^(/3,Q) 



(a) Assume model (1.2). Then we have the expansion 

sup MISE (f^) = O (n-^I^ + Nn-^ + iVn^^^^ + iVn^"^"' 

(72ee^(Q,Q),r2ee^(/3,Q) ^ ^ 
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and the choice 



O (n(2/3-i)/(2/3+i)) for /? < 1 A (2a - 1 /2) , 
O (n(4"-2)/{2/3+i)) for a<3/4A(/3/2 + l/4) 



yields 



sup MISE(f^,) 

a2ee^(a,Q),r2ee^(/3,Q) 



O (n-(4'^'-2/3)/(2/3+i)^ /or /? < 1 A (2a - 1/2) , 
O (n-(2-2a)/(2/3+i)^ /or a < 3/4A(/3/2 + l/4). 



Remark 3. is also possible, although more technical, to compute the asymptotic constant 
of the estimator f^, . Suppose that the micro structure noise is Gaussian and assume model 



(1.1) and 13 > 1 or (1.2) and (3 > l,a > 3/4, then we have more explicitly 

2N* °° / r-l \ 2 

MISE(fL) = / T^{x)dx+ y / T'^{x)il)k{x)dx\ +oiN*n-^). 

n Jo ,=V+A^ / 

Remark 4. There are of course simpler estimators for tkfl. For instance if we replace 



in (4.1) by (2^)^"*^ I^-i, where G ©n-i denotes the identity matrix, we obtain 

the quadratic variation estimator for tkfl (cf. /ij/J and it is not difficult to show that this 
estimator attains the optimal rate of convergence. This approach could even be extended 



to a nonparametric estimator of the form (4-.3). However, the single Fourier coefficients 



are not estimated efficiently, since in the case when the micro structure noise is Gaussian 
the asymptotic constant is 3n^^ J T^{x)dx (this is a straightforward extension of Theorem 



A.l in [271 ) whereas for our estimator we have 2n ^ J T^{x)dx (see Lemma G.l). If t 



is constant it can be easily seen that estimators in (4-1) are efficient for k = whereas 
quadratic variation is not. 

Remark 5. In practical application it would be more natural to use instead of n/\ogn in 



(4-2) other cut-off frequencies e.g. n'^/logn or qn, where 1/2 <7<1, 0<(7<1. Smaller 



7 decreases the variance while on the other hand increases the bias of the estimator. 

4.2 Estimation of 

Define J„ G On-i by 

{j^5i i, for rnV21 + l< i, j < 2 M/^] 
otherwise 
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Similar, as for the estimation of we first introduce an estimator of appropriate Fourier 
coefficients by 



Skfi 



AyM DJnD' ( AyM - 77rHk,o/3. 



(4.5) 



Tlie second part, i.e. —77r'^tkfl/3 is a bias correcting term, where the constant Tvr^/S is due 
to the choice of cut-off points [n^/^] + 1 and 2 [n^/^] in (4.4). As we will see, the estimator 
of tkfi has better convergence properties than the first term in s^^, and hence does not 
affect the asymptotic variance. Similar to (4.3), we put 



a%{t) = so,o + 2 ^ (si,o - sofi) cos {int) 



(4.6) 



Theorem 2 (MISE of a^). Let a]^ as defined in {'4-6). Suppose that N = Nn = o (n^/^), 
P > 5/4 and Q,Q > 0. Assume model (1.1) and a > 3/4 or model (1.2) and a > 3/2. 
Then it holds 

sup MISE = O (iV~2a ^ ^„-i/2\ 

^2eefc(a,Q),r2ee^(/3,Q) 

and minimizing the r.h.s. yields 

sup MISE {a%,) = O 

a2ee^(a,Q),r2ee^(/3,Q) 

for iV* = 0(ni/(4°+2)^_ 

The proof of Theorem [2] is given in Section A. 2 



n 



-a/(2a+l) 



Remark 6. It is also possible to extend this result for less smooth functions and r^. 



(i) Assume model (1.1) and a > 1/2, /? > 1. Then it holds 

sup MISE {a%) 



and 



N* 



O (n(2"-i)/(2"+i)) for a < 3/4 A {P- 1/2), 
O (n(2/3-2)/(2a+i)) /or /3< 5/4 A (a + 1/2) 



yields 



sup MISE(a^*) 

o-2ee^(Q:,Q),T2ee6(/3,Q) 



O (n-2°(2°-i)/{2a+i)) /or a < 3/4 A (/3- 1/2), 
O (n-2"(2/3-2)/(2a+i)j /or /3<5/4A(q + 1/2). 
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(ii) Assume model (1.2) and a > 3/2, /? > 1. Then it holds 

sup MISE {&%) = O (iV'^a + + iVn2-2/5) , 



and iV* = O (n(2/3-2)/(2a+i)^ ^^g/^^ 



sup MISE(ct^.)=0 



\ fn-MW-2)/{2a+l)\ 



a2ee^{Q:,Q),r26e6(/3,Q) 



Remark 7. /n analogy to {4-2), the estimator Sk.o can also be viewed as a spectral filter 
in Fourier domain, where essentially only the frequencies n^/^, . . . , 2n^/^ play a role. For 
practical purposes one can generalize this to estimators where the frequencies k, . . . , [cn^/^] , 
c > are used. If a is assumed to be very smooth, one even may set k = 1. In this more 
general setting, the constant — 77r^/3 in the definition of the estimator has to be replaced by 

UV21 



-n 



/{[cny-]-k)EZ'~^X.. 



Remark 8. Since the matrix D in the definition of s^fi is a discrete sine transform (for a 
definition see ISj/) the estimator a'j^ can be calculated explicitly taking O (Nnlogn) steps. 



5 Minimax 

In this section we will discuss the optimality of the proposed estimators. To this end we 
establish lower bounds with respect to Sobolev s-ellipsoids. 



Theorem 3. Assume model (i. j| j or model (1.2), a G N\{0}. Further assume r constant. 



Then there exists a C > (depending only on a,QJ,u), such that 
lim inf sup 



?i-s>oo <5-2 o-2ge^(a,Q) 
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Proof. The proof relies on a multiple hypothesis testing argument and is close to the proof 
given in [T7], Theorem 2.1. However, the lower bounds there are established with respect 
to the space of Holder continuous functions of index a on the interval [0, 1] , i.e. ioi I < u 

(a, L) := C'' {a, L, [I, u]) := |/ : f^P^ exists for p = [a] , 

f^P\x) - f^P\y) <L\x-y\''-P, \fx,y e I, < / < / < u < oo} . 

Therefore, the statement above does not follow immediately from |[17j. Theorem 2.1 because 
of (a, L) ^ (a, Q) due to boundary effects. Here, we will only point out the difference 
to the proof of [T7], Theorem 2.1. We write cTmmi fmax for the lower and upper bound 
of (T^, respectively, i.e. o"^ G Sg (a, Q, [o"min, Cmax])- Without loss of generality, we may 
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assume that cr^in = 1- For the multiple hypothesis testing argument (cf. [25]) a specific 
choice of functions af^ is required. For a construction see [T^, proof of Theorem 2.1 where 
L := (27r2"Q)^/^ / . It remains to show 

4„ee^(a,Q), i = 0,l,...,M. 

Due to the construction of af ^, we have cr2„(t) = 1 for t G [0, 1/4] U [3/4, 1] and cr,^ j'^ (0) = 
cjjj') (1) = for / G {0, 1, ...,«} . Thus, af^ G W (a,^^ ) (for a definition see 



Equation (B.l )), a G {1, 2, 3, . . .}, j = 0, . . . , M. Hence by Theorem B.l it follows af^ G 



e, (a,Q) for i = 0,...,M. □ 

6 Simulations 

In this section we briefly illustrate the performance of our estimators. Our aim is not to give 
a comprehensive simulation study, rather we would like to illustrate the behaviour of the 
estimator when assumptions of Theorems [l] and [2] are violated. In the following we plotted 
our estimator to simulated data, where we always set n = 25.000. From the point of view 
of financial statistics this is approximately the sample size obtained over a trading day (6.5 



hours) if log-returns are sampled at every second. For simplicity, we will choose in (4.3) 



and (4.6) as the minimizer of ||f2 - r^ll and ^ , respectively, which is in practice 

unknown. Of course, proper selection of the threshold N* is of major importance for the 
performance of the estimator. To this end various methods are available, among others, 
cross validation techniques, balancing principles, and variants thereof could be employed 
(see e.g. [S], [ID], [IE] and [H])- A thorough investigation is postponed to a separate paper. 
Throughout our simulations we assumed r = 0.01 and concentrated mainly on estimation 
of o"^, as it is the more challenging task. 



In Figure 
by Definition 



2 we have displayed the estimator for a{t) = (2 + cos (27rt))"'^^^. Note that 
1, has "infinite" smoothness, i.e. for any a > 0, we can find a Q < oo, 
such that (T^ G @s {o, Q) ■ The reconstruction shows that estimation of can be done 
much easier than estimation of cr^ although it is of smaller magnitude. In Figure [sj we are 
interested in the behavior of the estimators if heavy-tailed microstructure noise is present. 
This was simulated by generating Ci^n ~ 3~^/^t (3), i = 1, . . . ,n, i.i.d., where t (3) denotes 
a t-distribution with 3 degrees of freedom. We can see from Plot 1 in Figure [3] that 
the resulting microstructure noise has some severe outliers according to the tail of 
the density of t(3). Nevertheless, estimation of and o"^ is not visibly affected by the 
distribution of the noise. 

In the subsequent figures we illustrate the behaviour of the estimator when the required 
smoothness assumptions on cr^ and are violated. To this end, we investigate in Fig- 
ure |4] the situation when a is random itself, i.e. a realization from a Brownian motion. 
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a it) 



Wt 



The Brownian motion f Wt I was modelled as independent from the 

^ /te[o,i] 



Brownian motion in (1.1) and the microstructure noise process. It is of course not possible 
to reconstruct the complete path of cr^, but as Figure |4] indicates, the estimators at least 
detects the smoothed shape of the path and so our estimator might already reveal some 
parts of the pattern of volatility also in case a is non-deterministic, which is certainly more 
realistic in most applications. 

Finally, in Figure [5] we investigated the case of a being a jump-function. We put 
a {t) = 1 + 1(1/2,1] (0 ) a function with jump at t = 1/2. Fourier series usually show a 
Gibbs phenomenon, i.e. an oscillating behavior at discontinuities. This behavior is also 
clearly visible in the graph of a^. In order to reconstruct jumps in volatility other methods 
certainly will be more suitable and are postponed to a separate paper. 

Computational tasks: We implemented the estimators in Matlab using the routine 
fft() for the discrete sine transform (see Remark [s]). Calculation of the estimators for a 
sample size of n = 25.000 took around 2-3 seconds on a Intel Celeron 1.7 GHz processor. 
As mentioned in Remark [sj the estimator can be calculated in O {Nn log n) steps. If 
we choose with the optimal scale, i.e. N have for the complexity 

O (An log n) = o (n^/^logn) , whenever a > 1/2. 

Appendix A Convergence rate of (7^ 

In this section we will give a proof of Theorem [2} To this end we first introduce some 
notation and then prove a Lemma in order to get uniform estimates of bias and variance 
of the single estimators Skfl- 

A.l Preliminary Results and Notation 

Proofs of the upper bounds are based on a decomposition of AY^. In this subsection we 
present some further notation. Let ak{t) := a{t)fk{t) and Tk{t) := T{t)fk{t), t G [0, 1]. Let 
throughout the following for the Sobolev s-ellipsoids in Definition [l] for cj^ the constants 
being I = cjmm and u = cJmax and for t'^, I = Tmm, u = Tmax- We define 



sup max sup 



sup max max lAjT^I . (A.l) 

2ee5(;3,Q)fc<ni/4 i=l,...,n 



In order to do the proofs for model (1.1) and model (1.2) simultaneously, we first define 
the more general process 

Vfc,; := Xi^k + A2,A,. + Zi^k,i + ^2,fc, ^ = 1,2, (A. 2) 
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where Xi^k, ^2,k, Zi,k,l and Z2^k are n — 1 dimensional random vectors with components 



iXi,k), 


'■= <^k 


{i/n) AiWi/n, 


i^2,k)i 


■= Tk 


[i/n) Aiei^n, 


{Zi,k,i)i 


■= fk 


(i/n) / a ( 

Ji/n \ 


iZl,k,2)i 


■= fk 


(i/n) (Aid) 


{Z2,k)i 


■= fk 


(i/n) (Ajr) ei+i,„, 



dW, 



Obviously, AY^ = V^^i and ISY^ = Vk^2 if model (1.1) and (1.2) holds, respectively. Define 
the generalized estimators 4,o,/ := ^k,i^'^nD^^k,i and Sk,o,i ■= V^iDJnD^Vk^i - 77r^4,o,«/3. 
Further there exists a decomposition with Ci^k,i, C2^k £ M„_.i^„ such that 



Vfc,/ = Ci^k,ii + C'2,fce, 



(A.3) 



where e = (ei_„, . . . ,en,nf and ^ = is standard n-variate normal, e, ^ independent and 

Ci,k,ii = Xi^k + Zi^k,h C'2,fce = X2^k + ^2,fe- Now, let 



Sk,p ■- 



1 r-l 

al{x) cos{pTTx)dx, tk^p := / r|(a;) cos(p7ra;)(ix 
' Jo 



(A.4) 



be the scaled p-th Fourier coefficients of the cosine series of cr^ and r| , respectively. Define 
the sums A(al,r) by 

Sk,o + 2 Em=i Sk,2nm for r = mod 2n, 
^ ('^L "r) = S I]m=o ■5fc,2nm+n for r = n mod 2n, 

^ Eg=±r mod 2n, 5>o «fc,<? for r ^ mod n, 

and analogously yl(r|,r) with Sk,p replaced by Some properties of these variables are 
given in Lemma |D.1| and Lemma D.2 



Further define 



/ afc(l/n) 



V 



(A.5) 



fTfc(l-l/n) J 

We put Cum4 (e) := Cum4 {ei^n) for the fourth cumulant of ei^„. If X,Y are independent 
random vectors, we write X J-Y. 

A. 2 Proofs for Estimation of cr^ 



Lemma A.l. Let Skfi be defined as in {4-5). Further assume (5 > 1, Q,Q > 0, < cimm ^ 



0"max < OO, < Tmin < Tmax < OO and fc = fcn G N. 
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(i) Assume model (l-l), a > 1/2. Then it holds 



sup \E{skfl)- Skfl\=0[n-^l^ + n^'^ + n^/^-n , (A.6) 

<T2ee^(«,Q), r2ee^(/3,Q) ^<"''"' ' ' V / 

sup max Var {hfi) = O in'^l'^ + n^"^^) . (A.7) 



(a) Assume model (1.2), a > 5/4. Then it holds 



sup max |E(sfc,o)-Sfc,ol =0(ni"'' + n^/2-2a^^-i/4\ ^^ g^ 

a2ee6(Q,Q), T2ee6(/3,Q)'==^"''''' ' ' V / 

sup max Var {hfl) = O in-^'^ + n^"^^) . (A.9) 

(72ee6(a,Q), T2ee6(/3,Q) ' ^ ^ 

Proof. The proof mainly uses the generalized estimators as introduced in Section |A.1[ It is 
clear that for two centered random vectors P and Q 

{P,Q)^ ■.= E{P'DJnDQ) 



defines a semi-inner product and by Lemma |D.5 P _L Q =^ {P, Q)^ = 0. Hence 



E Sk,0,l = {^l,k,Xi^k)^ + {X2,k,X2,k)^ + {Zl^k,l, Zl,k,l)^ + {Z2,k, Z2,k) „ 

+2 Z^^k,i), + 2 (X2,fc, Z2,k), - ^ E (4,0,0 • (A-10) 



Clearly with (iii) in Lemma D.l and rn := n [n^^^] 



= - tr (SfcL>J„DSfe) = -tr( J„Z)S|D) 
n n 



n J] {A{al0)-A{al2i)) 



2[ni/2] 

1/2 

j=[ni/2]+i 

2[nl/2] 

= rnA{al0)-n-^/' ^ ^H,2i). 

i=[„l/2]+l 

Hence due to r„ < 1 and |r„ — 1| < n~^/^ 

oo 2 °° 

|(-'^l,fc,-'^l,fc)o. — ■Sa,.,o| < 2 ^ |sfc,m| + —1= ^ 



i=0 



and with Lemma lD.21 



sup max 

72ee^(a,Q) A:<ni/4 
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Next we will bound {X2^k, ^2,k) „ ■ In order to do this let Tk £ On-i with entries {T^j^j 
Tk {'i/iT') Si J- Further we define Tk € M„_i 



Tk 



{AiTkf for i=j-l, 
(AjTkf for i=j + l, 
otherwise. 



Note the relation 

Gov {X2,k) = TkKTk = l/2TiK + l/2KTi + 1/2?^. 
Using Lemma |D.3 yields 

{X2,k,X2,k), = E {XlkTkDJnDX2,k) = tr {DJnDTkKTk) 

= ^ tr {JnDTlKD) + ^ tr {JuDKTId) + ^ tr {jnDTkD 
1 



and further 



tr {KJnDTlD) + - tr [JnDTkD 



2[nl/2] 



tr {AJnDTiD) = ^ {A (rf, O) - A (r|, 2i)) 

i=[ni/2]+l 



j=[ni/2]+l 



,1/2 



Because max.^^^i/2j_^^ ^j^i/aj Aj = A2[„i/2-| < ^T^'^n \ it holds 



2[nl/2] 
i=[nl/2l+i 



2[nl/2] 

j_|'„i/2j^]^ ij=±2i mod 2n, g>0 

oo oo 



i=0 



i=0 



Therefore, (A. 14) can be written as 

2[nl/2] 

tr {KJnDTlD) - tkfin^'^ Yl 

i=rni/2l + i 



(A.ll) 



(A.12) 



(A.13) 



n-- ^^A{rl2i). (A. 14) 

j=[nl/2l+l 



This gives by Lemma D.7 and Lemma D.2 



sup max 



tr {KJnDTlD) - ^tfe,o = O {n-'l^) 
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Recall that tr ( J„) = O (n). It follows 



tr JnDTkD 



< tr (Jn) max 



DTkD 



< 4tr (J„)max(AjrA,.)' 



So, 



and therefore 



sup max 



sup max tr ( JnDTf^D ) = O (ra(/)„J 



fc<ni/4 



O ( ^ „I2 



We bound the remaining terms of (A. 10). Note 



{Zi^k,i, Zi,k,i), = tr {DJnD Gov {Z^,k,i)) < Ai (Gov tr (J„) < 2(/) 

implying 

sup max {Zi^k,i, 2'i,fc,i)^ = O ((/>.^) . 

In order to bound (^i,fc,2) '^i,fc,2)o- define 

■(iAi) + l 



n 



i ,jr = l,...,?l— 1 



and ASfc G ©n-i by 



We obtain 



and hence 



(^i,fc,2, ^i,fc,2>, = tr {DJnD (ASfc) L (AS^)) < 2n^/^<Pl 



sup max (^i,fc,2, -^i,fc,2)„ = O {n^^'^4>'i 

(72Ge^(a,Q)fc<ni/4 V 



Similarly to (A. 16), (^2,fc) ■^2,fc)o- ^ '^^n'^ and thus 



sup max {Z2,k, Z2,k)^ = O ((/)„n) . 
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Note that 



for j < i 

n^^ f"^ (i/n) a [i/n) (Aja) for j>i 



Hence by Proposition D.l, we obtain 



sup max \{Xi^k,Zi^k,2)J = O (n^^'^logn (pr. 



Applying the CS-inequahty gives 



sup max I (Xi^fe, Zi,fc,i)^| = O , 

(T2Ge6(Q,Q)fe<n^/'' 



Using Proposition C.l| this yields (A.6) and ( |A.8 ). In order to give an upper bound for the 
variance of s^^o,/ T^ote 



2^4 



Var {sk,o,i) < 2 Var {VliDJnDVk,i) + 2^ Var (4,o,0 • 



Furthermore we have using ( A.3[ ) and Lemma D.3 



VI 



VliDJ„DVk,i = i'Clk,iDJnDClj,^ii + 2eClk,iDJnDC2,ke + e'Clj,DJnDC2,ke 
< 2eClk,iDJnDClkA + 2e'Cl^DJnDC2,ke. 



Hence 



Var {VliDJnDVu,i) < 8 Var (e*C^,,D J„Z)Ci,fc,ze) + 8 Var {e'C{^,DJnDC2,ke) . 



Finally, we bound Var (^^*C* ^ J„DCi^fc^/^j and Yai ye^C^ j^DJnDC2,k^) in two steps, 
which will be denoted by (a) and (6). 



(a) By Lemma D.4 (iii), we have 

Var {eClk^iDJnDCi,k,iC) = 2 J^'l? Gov (Xi,^ + Zi,k,i) DJ^/^ 



< 8 

< 16 



jy^D (Gov + Gov DJ^/^ 



jy^Z) Gov (Xi,fc)Z)jV2 



+ 16 



Jll^D Cow {Z^^k,i)DJll^ 



(A.20) 



Firstly, 



Jl'^D Gov < A? (Gov tr (j^) < 4n-i/2^4 ^ 



jy^i) Gov (Zi,fc,2) I^^/' ^ < A? (Z) J„D) tr (Gov (^1,^,2)') < 4n(/.^ ||L||^ < Ar?<j)t 



and also 



Gov < xl (Gov tr ( J^) < 



'1/2 



Therefore, 



sup max Var (e*Cj ^ iI?J„DCi,fc,;e) = O (n-V2) . 



(b) Next, we see with the same arguments as in (A.20) 

Var {e'ClkDJnDC2,ke) < (2 + Gum4 (e)) pl^^D Gov (X2,fc + Z2,k) DJ^^^ 

< 8 (2 + Gum4 (e)) J^^^ Gov (Xa,^) Z^J^^ 

+ 8 (2 + Gum4 (e)) 11 J^^Z? Gov (Z2,fc) Z^J^^ 

We obtain 

J'J^D Gov (Z2,fc) Z) ' < 4^4 (j2) ^ 4,^^n3/2. 



From (A. 11) follows now 



2 3 



2 3 



4/'dcov{x,,,)d4/' jV2z)r|z^i)ji/2 ^ + ^ jy2z)r,z)jy2 

Let In-i be the n — 1 x n — 1 identity matrix. Due to AJ„A < ^n^^ln^/^]^^^'^ ^® have 



Jll^DTlKDJl/^ ^ = tr ( Ji/2^rf AJ„AT|Z) 

<^2[nV.]-'/'tr(jy2Z)T,^Z^jy2) 
2 

2[ni/2] 



Also 



and therefore 



Jl'^DfuDJl''' < \l ( J„) 



< 2n 



214 



sup max Var (e*C^ ^DJnDC2 ^e) = O fn"^/^ + n^n) • 

• -■ A:<nl/4 ' ' ' ^ V / 



Gombining (a) and (b) gives 



sup max Var {V^ iDJ^DYk^i) = O (n^^/^ _^ ^2^4 \ ^ 



so (A. 7) and (A. 9) follow with Lemma G.l and Propositon G.l 



□ 
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Proof of Theorem \^ We decompose 

MISE {&%) = [ Bias^ {^%{t)) dt+ [ Var {a%{t)) dt. 
Jo Jo 

We have that (T^(t) = "^"^q {ipk^'^'^) i^k (t) , where ( . , . ) denotes the standard scalar 
product on L^[0, 1]. Let := E(sfe,o,«) - Sk,o- Then for i > 1, E (2sj,o,i - 2so,o,«) = 

2^/2 (^.,(72) + 2r]i^n,i - 2%,n,/- Hencc 

/ Bias^ = ??o,n,/ + 2^(7?i,„,z-7yo,n,/)'+ ^ (V'i,^^')' 

■^0 i=l i=iV+l 

and we obtain 

N 

vln,i + 2 J] - r]o,n,if < (8iV + 1) max 7?f„ 

* — U,...,iV 

1=1 

Because cr^ G 0^ (a, Q) it holds 

oo ^ oo 

E (^^' ^ 7]^TT^ E (V'- ^ (iV + 1)-- . (A.21) 

i=N+l ^-^^ 1=1 

Therefore, 

sup / Bias^ {^^(t)) dt 

(T2e0b(a,Q), ^2g0b(^ Jo 

O ( sup max r/^^ ^ + iV-^" 



Assume model (1.1), then by Lemma A.l 



sup / Bias^ (o-2(t)) 

O (^iVn"i/2 _^ iVn2-2/3 + iVn^"2" + AT 



f72ee6(Q,(3),T2ee6(/3,Q) 



and for model (1.2), 



sup / Bias^ (<5-^(t)) dt 

<72e0^(a,Q), r2e0^(/3,C3) -^0 
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For the variance note 

N 



/ Var (a^ (t)) dt = Var (so,o,0 + 2 5Z (^^^'O-' " ^^O; 

iV 

< (4iV + 1) Var (so,o,0 + 4 ^ Var (s,,, 



i,0,/J 



Hence 



sup / Var (a^ (t)) 



O I sup max Var {skfl^i) 

<T2e0^(a,Q),r2Ge5(/3,Q)°^'=^^ 



Using Lemma A.l yields the result. □ 



Appendix B Sobolev s-ellipsoids 

In this chapter we will shortly discuss the function space introduced in Section [3] and 
provide a theorem needed for the lower bound. First recall the classical definition of Sobolev 
ellipsoids (cf. Proposition 1.14 in [25j). 



Definition B.l. Define 



j", for even j, 

U - 1)" . for odd j 



Let {ipj}JLi, (piix) := 1, 4'2j{x) := \/2 cos (27rj3;) , (j)2j+i{x) := \/2 sin (27rjx) denote the 
trigonometric basis on [0, 1]. Then we call the function space 

{00 00 
/ G L^[0, 1] : 3 {9r.)Zi , s. t. fix) = ^ 6^^^ (x) , ^ aljf < C 
1=1 i=l J 

a Sobolev ellipsoid. 

Interesting characterizations arise if we put Sobolev s-ellipsoids into relation with Sobolev 
ellipsoids: 

Remark B.l. Let S be the class of all symmetric functions f £ L'^[0, 1] such that f{x) = 
/(I — x), Vx G [0, 1]. Further let Q{a,C) be a Sobolev ellipsoid. Then a function belongs 
to 0s(a, C) nS if and only if it belongs to Q{a, C) n S. 
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Let 



W{a, C) ■.=W{a, C, [0, 1]) := {/ G l2[0, 1] : /W(0) = = 

for I odd, ^ < «> ^ (^f^"\x)ydx < C 



(B.l) 



For positive integer values of a, we have the fohowing equivalence. 

Theorem B.l. Assume a G {1, 2, . . .}, C > 0. Let C = 27r^"C. Then a function is in 
yV{a, C) if and only if it is in Gs(a, C). 

Proof. First we show that if a function / G 'W{a,C) then also / G 0s(q;, C). Let / be 
defined on [—1, 1] by 



fix) :-- 



f{x) for X G [0, 1] 

f{-x) for xG[-1,0] 



Note that / is an a-times differentiable function, /*^^) is even if I is even and /*^^^ is odd if I 
is odd. Let 

!\f^^\x)dx for A; = 

i-i f^''\x) cos{kTrx)dx for A; > l,j even • 
J^i f^-'\x) sm{kTrx)dx for k> l,j odd 

It holds for j > 1 

soU) = P\x)dx = - = 0. 

Hence we have the Parseval type equality 



/ dx = -Y,sl{a). 

''^ k=i 



(B.2) 



Further for k > 1, j even, it follows by partial integration 



Skij) = J f^^\x) cos{k'Kx)dx 

1 ~ ■ 

= f^^~^\x) cos{k7Tx) + kir f^^~^\x) sm{kTrx)dx = kirsk^j — 1) 
-1 J-i 



and ioT k > 1 and j odd 

Sk{j) = / f^-'\x) sm{k'iTx)dx 



1 /"^ ~ • 

f^-'~^\x) sin{kTrx) — kn f^^~^\x) cos{kTTx)dx = —kirskij — 1) 
-1 7-1 
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With Ok = f{x)cos{k7rx)dx it follows for A: > 1 sl{a) = fc^^yr^^s^ = 4P"7r2"02^ Com- 



bining this result with (B.2) yields 



•^0 k=i 



and hence proves the first part of the theorem. The other direction follows in a straightfor- 
ward way by differentiation and is thus omitted. □ 



Supplementary Material 

Supplement: Proofs for upper bound of and further technicalities 

(http://www.stochastik.math.uni-goettingen.de/munk) In the supplementary material we 
provide a proof for Theorem [T] and summarize results from linear algebra and matrix theory 
needed for the proofs. 
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3 




Figure 1: Plots 1 and 2 display paths of sBM and tBM corresponding to a(t) = (1/2 — t)_^_ 
(Plot 3). Analogously, Plots 4 and 5 show paths of sBM and tBM with a {t) = 1+1(1/2.1] (0 
(Plot 6). For Plots 1 and 2 as well as Plots 4 and 5 we took the same realization {Wt)i-^^Q 
of the underlying Brownian motion. The first two plots show the different scaling behavior: 
sBM= and tBM= J^^^ a (s) dWg for t > 1/2. On the other hand we see by Plots 4 and 5 
that a jump induces a random shift, i.e. sBM=tBM for t<l/2 and tBM+Wi/2 =sBM for 
t > 1/2. 





0.2 0.4 o.e 0.8 




0.2 0.4 O.e 0.8 1 




TV (0,1), i.i-d., r = 0.1, a{t) 



Figure 2: n = 25000 data points from model (1.1), e^^ 

1 /2 

(2 + cos (27rt)) ' . Plot 1 shows the data. Additionally to the data, we plotted the path 
of the tBM in Plot 2. The reconstruction of and (dashed lines) as well as the true 
function (solid lines) are given in Plot 3 and 4, respectively. The threshold parameters were 
selected as A^* = 1 for estimation of and N* = 3 for estimation of cr^. 
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0.2 0.4 O.e 0.8 



Figure 3: (Heavy-tailed microstructure noise) As Figure [2] but instead of Gaussian errors 
we assumed that the noise follows a normalized Student's t-distribution with 3 degrees 
of freedom. We observe that performance of and o"^ is quite robust to heavy-tailed 
noise. The threshold parameters N* were selected as 1 and 3 for estimation of and o"^, 
respectively. 
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0.2 0.4 o.e 0.8 1 



0.2 0.4 O.e 0.8 1 




0.2 0.4 O.e 0.8 




0.2 0.4 O.e 



but we chose a(t) = 3 Wt , where ( Wt ] 
_ V / tg[o 1 1 

denotes a Brownian motion independent of the noise and the Brownian motion in (1.1). 



Figure 4: (Low-smoothness) As Figure 



The estimator returns a smoothed version of the path. The threshold parameters A^* were 
selected as 1 and 17 for estimation of and cr^, respectively. 
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Figure 5: (Jump function) As Figure [2] but we chose a{t) = 1 + I( 1/2,1] (*) • The Gibbs 
phenomenon is clearly visible. The threshold parameters N* were selected as 1 and 10 for 
estimation of and o"^, respectively. 
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Appendix C Convergence Rate of 
C.l Preliminary Results and Notation 

First we recall some notation. Let ak{i/n) := a{i/n)fk{i/n) and Tk{i/n) := T{i/n)fk{i/n). 
Let throughout the following for the Sobolev s-ellipsoids in Definition [l] for cr^ the constants 
being / = <Tmin and u = (Tmax and for r^, / = Tmin) u = Tmax- We define Kn ■= n^/^/logn 



Axel Munk* and Johannes Schmidt- Hieber 



and 



'n 



sup 



max 

i=l,...,n— 1 



sup 

^e[i/n,i+l/n] 
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Proof. We only prove the third equaUty the other two can be deduced similarly. Note that 
for r2 E e''(/3,g), 



Tk id) - Tk 



n 



< V2 



r [ii) - r 



n 



in] 



< 



1 



V27 



+ rUlki^l (2n) . 



n 



Taking supremum and applying Lemma D.8 gives the result. 



□ 



C.2 Proofs for Estimation of 



Lemma C.l. Let ikfl be defined as in {4-1)- Further assume a,P > 1/2, Q,Q > 0, 



< o"min < CTmax < CO, < Tmin < Tmax < CO and /c = G N. Assume model (1.1). Then 

(C.l) 

max Var iih o) = O (ri'^\ . 



sup max |e4,o - tkfl\ = O (n^/^^^ log^/^ n) + o (n~^^'^) 

72e0|(a,Q),T2G0|(/3,Q)''-'^" ' ' / V / 



sup 



If further e is n-variate standard normal, then 



max 



sup 

cr2ee^(a,Q), r2ee^(/3,Q) 



Var (4,o) 



n 



T^{x)dx 



n 



1/2 



= O (n ^ log ^ n) , 
(4,0 - tk,o) A (^0, 2 4{x)dx^ for 13 > I, k < Kr, 



Assume model (1.2). Then it holds 



sup max |Etfc^o ~ *fc,o| 



O f ni/2-/3 logi/2 n + ni-2a ^^^2 \ ^ ^ / -1/2 



sup m<^ Var (4,o) = O (n^ log^ n + n ^) . 



a2ee^(a,Q), r2ee^(/3,Q)' 

If further e is n-variate standard normal, then 



max 



sup 

<T2ee^(a,Q),T2e0^(/3,Q) ''-^'^ 



Var (4,o) 



n 



T^{x)dx 



n 



1/2 



= O log^ n + n-^ log"^ n) , 

(4,0 - tkfl) A (^0, 2 ^ T|(x)dx^ /or a > 3/4, /3 > 1, A; < 



(C.2) 



(C.3) 
(C.4) 



(C.5) 
(C.6) 



(C.7) 
(C.8) 
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Proof. Again we work with the generahzed estimators as introduced in Section |A.1[ As in 



the proof of Lemma A.l|we introduce for two centered random vectors P and Q a semi-inner 



product defined by {P,Q)^ := E {P^DJID^Q) and obtain 

+ 2 Zi^k,l)r + 2 {X2,k, Z2,k)r ■ (C.9) 



First we bound {X2^k, ^2,fc)^; which will turn out to be the leading term. Similar to (A. 13) 
we have 

{X2,k,X2,k)^ = tr [KJlD'TlD] + ^ tr [r^D'fkD) , 

and due to 

tr (j;) = (log n) (C.IO) 



the same argument as for (A. 15) gives 



sup max tr (j^DTfcZ)) = O (0^ log'^l • 



Hence this is a negligible term. Using Lemma D.l 



m 



n-l 

tr {AJ^D'TiD) ={n- n/logny' {A (r|,0) - A (T|,2i)) 

i=[n/ log n] 

n-l 

= TnA (r|, 0) - (n - n/ logn)"^ ^ A {rl 2i) , 

i=[n/ log n] 

where = {n — [n/logn]) / {n — n/logn) . Note 1 — f„ < 1/ (n — n/logn) . By Lemma 

EJ 



sup max |tr (AJ^^D^T^D) - tk. 



k<K, 







< sup max (1 - r„) |tfc,o| + |ife,m| + 2 (n - n/logn) ^V'lifc.jl 

oo 

<2Cp,Qn^l'^-^ + Q{n-n/\ogn)-^ sup ^ |to,i| = O (n~^ + n^/^"^) . 

r2e06(/3,O) i=0 



This shows that 



sup max (X2,fe,X2,fc)^-to,fc =0 n + 0„ -^^ logn + n^ 
r2eG^(/3,0)''-^" ^ 
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The remaining part of the proofs of (C.5) and (C.l) is concerned with bounding the other 



expressions in (C.9). Applying Lemma |D-3| we obtain 

1 



implying 



= - tr [DJID'T.1) < 2a^ax- tr {J^ 
^ n n 



sup max {Xi^k, Xi.k)^ = O {n ^ logn) . 



We obtain with Lemma D.6 in the same way as in ( A.16[ ), (A. 18) and ( |A.19 ) 



sup max {Zi^k,i,Zi^k,i)r = O {n Mogn 0^) 

sup max {Zi^k,2, ^i,fc,2)^ = O (log^ n (pl) , 
f72e0b(Q,Q) k<K„ 



sup max {Z2^k,Z2,k)r = O (4'ni/2^ogn 
From the Cauchy-Schwarz inequality follows that 

< {Xi^k,Xi^k)j. + {Zi^k,i, ^i.fc,;),- ) 



\{X2,k, Z2^k)T-\ - {^2,k,X2,k)l-^'^ {Z2,k, ^2 



This yields 



sup 



max |-E4,o - tk,o\ 

, k<K„ 



,1/2 



T2ee^(a,Q), r260|(/3,Q) 

O (n^^ log n + + (j)n,i/2 log^^^ n 



O ( n Mog n + + 0^,1/2 log^^^ ^^ + log 



n 



for Z = 1, 
for I = 2, 



and therefore (C.5) and (C.l ) holds by Proposition C.l In order to calculate the covariance 



we use the decomposition (A.3). We have 

ik,o,i = i^C\,^iDJ'^DCi^k,ii + '^i^C{,^iDJ^DC2,k(- + (^C\^^DJ'^DC2^k(^- 
Using the CS-inequality repeatedly, we can write 

|Var (4,0,/) - Var {e'ClkDJlDC2,ke) \ 

< (VarV2 (^*C7*,,,,Z)j;Z)Ci,fe,ie) +2Vari/2 J^DCa.fce)) ' 

+ (Vari/2 {eClk,iDJlDCi^k,iei +2\a?'^ {eCik,iDJlDC2,ke 
•2VarV2 {e'Cl^,DJlDC2,ke) 



(C.ll) 
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We subdivide the remaining part of the proofs of (C.6) and (C.2) into three steps (a), 
(b) and (c), where we calculate Var (e'^C^ j^DJ^DC2,ke^ , Var ^ iDJ^DCi^k,iij and 

Var (^^C{ f^ iDJ'^DC2,k^ , respectively. 

(a) Let TrSq(^) := YJi=i ^li for A G M„. Then by Lemma 



D.5 



it follows 



Var {e'ClkDj;,DC2,ke) 

= 2 \\ClkDj;,DC2,k\\l + Cum4 (e) TVSq (C|fcL»j;^L»C2,fe) 



< (2 + Cum4 (e)) ( J^)^/' D Gov {X2,k + ^2,fc) D ( 



where equality holds if Cum4 (e) = 0. By Proposition D.2 we see that 

YaT{e'Cij,Dj;,DC2,ke) - - [' 4{x)dx 

n Jo 

= O (Cum4 (e) n^^ + n^^ log^"*^ n) . 



sup max 



(b) In this part of the proof we will bound Var iqCl ^ iDJ^DCi^k,i^ ) • Similar to part (a) 



in the proof of Lemma A.l it holds 



Ys.T{eCl,^iDj;,DCi,k,i^) < 16A?(J^) ||Cov(Xi,fc)||^ + ||Cov(Zi,fc, 



< 



1^2 



/ = 1, 



28n-2 log^ n (n- + "^^1^) , 1 = 2, 



where we used Lemma D.6 in the second inequality. Hence we get 



O (n^'^ log^ n) 1 = 1 

sup max\s.r{eCl,^iDJ:DC,,k,i^) = { ' (C.12) 

a^eeb{a,Q)''<Kr. I O (log^n ((/.^ + , 1 = 2. 



(c) Using Lemma D.5 (ii) 



YB.T{eClk,iDJ:,DC2,ke) < ^Vari/2 J^Z^Ci,^,,^) \\CI,DJ:DC2„ 



and hence 



sup max Var {i^C{^f, iDJlDC2,ke) 



0(n-2log2n), ^ = 1, 

0(log2n((/>2 +n-3/2))0(n-V2), / = 2. 
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Combining (a), (b) and (c) in (C.ll) yields 



sup max 

-x^ee^KQ), r2ee|(/3,Q) ^^^^ 



Var (4,o) 



n 



Q ( Cum4 (e) ^ 



1 



n 



nlogn 



+ 



O {(jjnlog^ n + (/)nn 1 = 1, 



0, 



(C.13) 



and hence (C.2), (C.3), (C.6) and (C.7) follow using Proposition C.l 



Finally we will show the asymptotic normality (C.8) and (C.4). Because of the decom^ 



position (A.3), we have 



As proved above n^'^ {i^C\ ^^pj^DCi^k,ii + 2i^C\ ^^ ^DJ^DC2,ke) ^ for /3 > 1, a > 3/4 
if / = 1 and /3 > 1 if / = 2. Hence by Slutzky's Lemma it suffices to show that 



In order to apply Theorem D.l it remains to show 

n^l^Xi {ClkDJlDC2,k) ^ 0. 
Using Corollary |D.1[ we see that 

n^/^Ai {Ci^^DJlDC2,k) < 4n-i/2 iog2 (Cov {X2,k + ^2,fc)) 

<8n~^/^log^n sup T|(t) max Aj + Bn""^''^ log^ = o (1) 

te[o,i] » 

which yields the last statement of the lemma. 

Proof of Theorem [7J The proof is close to the one of Theorem [2] We obtain 
sup / Bias (f^(t)) dt 

= 0\ N sup max |E (4,o,z) - tfc,of + A^"^^ 

sup / Var (f^(t)) (it 

cr2ee6(Q,Q), r2ee^(^,Q) -^o 

O I A sup max Var [ikQij 

<726e^(a,Q), r2ee^(/3,Q)°^'=^^ 



□ 



□ 
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Appendix D Technical Results 

Proposition D.l. Let A € M„_i. Then 

tr (JnDAD) < (^n + Sn^/^ + Sn^/^ (1 + logn)) max (A) 

Proof. Write A = {aij)^-^^ . Note that 

2 . / ip-K\ . / gjvrN 



p,g=l 



n y \ n J 



For i = j we have further 

^ n— 1 



COS 



n-l 



P,(J=1 

In order to bound the r.h.s. we need bounds for 



E 



cos r 



^7^ 



n 



< 



Dir, 



i=[ni/2] + l 

sin ((2 +1/2) rvr/ 



[„i/2] (rvr/n) - Dirj„i/2j (rvr/n) 



< 



< 



2 sin (rvr/ (2n)) 



1 



+ 



sm 



(([ni/2] +1/2) r^/n) 



2 sin (r7r/(2n)) 



sin (ryr/ (2n)) 

Let Bi :={!,..., n} and := {n + 1, . . . , 2n - 2}. Then 

for r = 0, 



2[nl/2] 



E 



cos r 



j=[ni/2l+i 



ITT 



n 



< < 



n 

4n/r for r £ Bi, 

nj (2n — r) for r E i?2- 

Therefore, we can bound the first term of the r.h.s. of ( |D.l ) by 



i=[ni/2]+i P,g=l 



^ n— 1 



p,q=l 



^ COS ( (p - 

i=[ni/2]+l 



< n max |a. 

p,(;=l,...,n,— 1 



n-l 



3/2 



4n 
1 Q-P 



+ 2 E 



(D.l) 
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and the second term by 



["^ ' i n-1 , 

ni/2]+l P,q=l 



( 



< n 



-1 



max a- 



Due to 



p,g=l,...,ri— 1 

< 5n max |a 

p,g=l,...,n-l 
n-1 



n— 1 . n—l 

+ > 

TJ -I- n 



n 



p + q 2n - + q) 



< n - < n (1 + log 



n) 



P,9=l 



q-p 



r=l 



and 



2[ni/2] 

i=[„l/2]+l 

< (n + 5n^/^ + Sn^/^ (1 + log 



n 



max I a 

p,g=l,...,n— 1 



we obtain the result. 
Proposition D.2. It holds 



sup max 

r2ee4/3,Q) A;<ni/2 



2 2 



□ 



= O log~^ n) . 

Proo/. We obtain with ( |A.12[ ) Gov (Xg.fc + ^2,^) = 1/2T^K + 1/2KT^ + Sk, where S"/, := 
l/2Tfc+Cov (X2,fc, ■^2,fc)+Cov (.^2,fc, -'^2,fc)+Cov {Z2^k) ■ Application of the triangle inequality 
gives 



{rj'^'D{TiK + KTi)D{j;, 



.1/2 



m^/^DSuD{Jl 



,1/2 



< 



< 



1 



{rS'^ D Gov {X2,k + Z2,k) D {Jlf^ 
{Jlf'^DiTlK + KTl)D{rS'^ 



+ 



{Jlf'^DSuD{Jl 



,1/2 



(D.2) 



Note that because of Lemma |D.4 (iii) it holds 



tr ({{Jlf'^ DTiKD {rj'/'Y^ < I \\{J:f' D {tIk + KTI) D (J- 



1/2 



< 



,1/2 



(D.3) 
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Now we will bound 



tv[[{rS'^DTlKD{Jl 



,1/2 



tr 

n-l 



[{JlKf/^DTlD{hrJ'^ 
Y,>^j{DTlDKJl) 



i=l 



from below. We obtain with Lemma lD.31 

An-[n/logn] (AJ^) \n/\ozn]-l+i {DT^D) , i<n - [n/logu] , 



A, {DTlDKJl) > 



0, 



i > n — [n/ logn] 



Denote by Tf^^^i) the i-th largest component of the vector 

(rfc(l/n),...,Tfc(l-l/n)). 

Then 

2^ 



tr(((j;)i/2 DTlKD{Jl 



n-l 



Y,>?i{DTlDKJl) 



i=l 



n—[n/ log n] 

> (™-^/log^)~^T-^,([n/logn]-l+i) 
j=l 

n-l 



> (n - n/logn) ^ 



j=i 



n 



n 



'logn 



[n — nj logn)' 



(D.4) 



Next we will derive an upper bound for the r.h.s. of (D.3 ). Let analogously to the Definition 
(A. 11) Tfc be a tridiagonal matrix with entries 



Note that maxj | AjT|| < 2t, 



1/2 7 
rmax<Pn,l/2 



for i=j-l, 
for i = j + 1, 
) otherwise. 

. It is easy to check that TIKTI = \I2T^K^\I2KT^^ 
-1 A-l 



1/2Tk holds. Clearly, Jn 1^ {n — n/ logn) A , and therefore we have for the upper bound 
in dDlsl) 



{Jlf'^DTlKD{rJl^ <(n-n/logn)-i {rj^^ DTIKDA-^' 

F 

<{n-n/ logn)-i tr {[Jlf^ DTIKTID {Jlf'^) 

< {n-n/\ogny^tT {T^KDJJ^D) + ^ {n - n/ logn)' ^ tr {DTkDj;,) 

< (n — n/logn)~^tr (T^) + 2 (n — n/logn)^^ max |Tfc|..tr(J^ 

ij=l,...,n— 1 ' 
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where we used in the last inequaUty an argument as for (A.15). Combining this with (D.4) 
and Proposition C.l| yields 

-I 



sup max 



{rS"D{TlK + KTl)D{Jlfl^ 



2 2 

F n 



0(n Vn,i/2+"' ^logn.-^^ 



",1/2 



+ 71^^ log n] . 



Now we will bound the remainder term in (D.2). Using Lemma D.6 gives 

2 
F 



{Jlf/^DSkD{Jl 



,1/2 



< >?im\\Sk\\i 



< 16n ^ log^ n 



Tk 



+ 4||Cov(Z2,fc)||^ 



+S\\Cow{X2,k,Z2,k)\\l 



Because Gov {X2^ki Z2,k) is tridiagonal it holds with Lemma D.4 (i) 

n-l 2 

IICov {X2,k, Z2,k)\\l = Yl (Cov (X2,fc, ^2,fc),,,) < 8nr„ 



'^",1/2 



and therefore 



1/2 



< 16n log n ( 2n(/)„^i/2 + 16n(/)„ ;^/2 + 64n(/)„^/2'^, 



",1/2 



Vl/2' 



This leads to 



(D.5) 



sup max 



{j:f^DSkD{Jl) 



1/2 



O (n-^log^n-^2 



n,l/2 



and with (D.2) and ( |D.5 ) completes the proof 



Lemma D.l. Let Sk,p and tk^p as defined in (A. 4)- Then it holds 



□ 



1 



1 



1 



1 



1 



1 



n-l 



r=l 



1 
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(in) Let as defined in (A. 5). Then 



DElD)^^^ = A {ali -j)-A {ali + j] 



Remark D.l. In (Hi), for \i — j\ <^ i + j, the r.h.s. behaves like Sk^i-j- In the same way 
we obtain the equivalent result if we replace by t"^ . 

Proof, (ii) Note that we can write 



Sk,o + 2 ^ Sk,q COS (gvrr/ 

9=1 



n) 



and hence it holds 

n-l 



n ^ 



n 



cos 



^ n— 1 

-Sfc.oVlcos 



l'prTT\ 
\ n J n 



n-l 



r=l q=l r=l 

Let ^{A} denote the indicator function on the set A. We have the identities 

n-l 



/pr'K\ 

cos 

V n / 



) = nI{p=o mod 2n} " ^ + ("l)^) 



and 



n-l 



COS 



r=l 

From this it fohows 

n-l 



n-l 

E 

r=l 



COS 



(g — p) vrr 



n-l 



+ E 



cos 



r=l 



(q' + p) vrr 



n 



-E 



0" 



r=l 



n 



/prir 

cos , 

V n / 



1 

n 



^ oo 

i(i + (-if).,,o-E^M(i + (-ir') 



9=1 



which yields the result. 

(iii) This follows by applying (ii) to 

„ n-l 



«j n 

_ 1 
n 



E 

r=l 
n— 1 

E 

r=l 



n 



n 



sm 



cos 



sm 

n J 



n 



rjTT 
n 



^ n— 1 



n 



r=l 



cos 



(i + j) riT 
n 



□ 
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The next Lemma gives a bound of the absolute values of Fourier coefficients of cr^ 
in Sobolev s-ellipsoids. In particular the result shows that the Fourier series is absolute 
summable. 



Lemma D.2. Let st^p he as defined in (A. 4)- Assume k < cn"' , where < c < 1 is a 



constant and either 7 > 0, q > 1/2 or /c = 0, 7 = and a > 1/2. Then it holds for n large 
enough 



where C^^a,Q,c is independent of n. 



sup ^ ^ l^fc^ml ^ C^,a,< 



7(l/2-«) 



Proof. Consider the case 7 > 0,a > 1/2. Using Lemma D.l (i), we see that for n large 
enough 

00 00 00 

\Sk,m\ < |sO,m| = \so,m\ I\m>\(l-c)n'y]} 

JTl=[7l'l'] 7Tt=[(l-c)nT] m=l 

/ 00 2 \ 1/2 / 00 \ 

\i=l J \j=[(l-c)nT] / 

where we used the definition of a Sobolev s-ellipsoid in the last step. If /c = 0, 7 = and 
a > 1/2 we can argue similarly. 

□ 

In the next lemma we collect some important facts about positive semidefinite matrices 
and trace calculation. 

Lemma D.3. (i) Let A £ M„ be symmetric. A is positive semidefinite iff A = B^B for 

some B ^Mn- 

(a) If A,B are positive semidefinite matrices. Denote by Xi{A) the largest eigenvalue of 
A. Thent-[{AB) < Xi{A)tT{B). 

(Hi) LetA,B&M.n-i be positive semidefinite. Then 

Xr+s+i (AB) < Xr+i {A) Xs+i [B) 0<r + s<n-2 

K-r-s+l {AB) > Xn-r (A) Xn-s (B) 2 < r + S < U. 

(iv) Let A and B symmetric matrices. Then 

Xr+s+i iA + B)< Xr+i {A) + A,+i (B) < r + s < n - 2. 
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(v) (CS inequality for trace operator) Let A and B matrices of the same size. Then 

|tr [AB') I < tri/2 (^AA') tr^/^ [BB') . 

(vi) Let A,B matrices of the same size. Then 

A^B + B^A < A* A + B*B. 
Corollary D.l. Let A and B matrices of the same size. Then 
Ai {AB^ + BA^) < Ai [AA^) + Ai [BB*) . 



Proof. By Lemma D.3 {vi) A^B + AB^ < A^A + B^B. Applying Lemma D.3 {iv) for 



r = s = yields the result. □ 
In the following Lemma, we summarize some facts on Frobenius norms. 

Lemma D. 4. Let A eMn-i- Then 



n—l 1 

\\A\\l ■.= tr {AA') = Y,X,\AA') = Y,al^ 

i=l i,j=l 

and whenever A = A^ also \\A\\'^p = J2i=i (^)- 
(ii) It holds 

4tr{A^) < \\A + A^fp <4||A||^. 

(Hi) Let A, B be positive semidefinite matrices of the same size and < A < B . Further 
let X be another matrix of the same size. Then 

\\X^AX\\p < \\X^BX\\p. 

Proof, (i) and (ii) is well known and omitted, (iii) By assumptions it holds < X^AX < 
X^BX. Hence Xf {X^AX) < Xj [X^BX) and the result fohows. □ 

Lemma D.5. Let V = (Fi, . . . , y„) , TV = (VFi, . . . , VFm) two independent, centered 
random vectors. Let A = {o,ij).j^^ ^ E M„ and B E M„_m- Then 



(i) E (V^AV) = tr {A Gov (V)), E {V^BW) = and 
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(ii) Assume further that Vi -L Vj for all i,j = l,...,n, i ^ j and Wk -L Wi for all 
k,l = 1, . . . ,m, k ^ I and Var {Vi) = Var {Wk) = 1 for i = 1, . . . ,n and j = 1, . . . ,m. 
We set TVSq(yl) := X^^^^ af ^. Then 

n 

Var {V^AV) = Cum4 {V) ^ al + tr {A^ + AA^) 

1=1 

n 

<Cnra^{V)Y,al + 2\\A\\] 



1=1 

|2 



< (2 + Cum4(y))p||^, (D.6) 
Yar{V*BW) = \\B\\l , 
Var {V^ABW) < ■ (D.7) 

Proof. We only proof the first and the last statement in (ii). Note that 

n 

Var {V'AV) = ^ aijOki Gov {ViVj ,VkVi) . 

i,j,k,l=l 

lii = j = k = l then Gov {ViVj, VkVi) = 2 + Gum4 {V); ii i = k, j = I, i j or i = I, j = k, 
i / j then Gov {ViVj,VkVi) = I. Otherwise Gov {ViVj , VkVi) = and this gives ( [P^ . 
In order to see ( D.7| ) note that by Lemma D.3| (v) 

Var {V^ABW) = \\B^A\\p = tr ((55*) {AA^)) 



< tri/2 ({BBY) tri/2 ((AA*)' = \\BB'\\^ \\AA'\ 



F ■ 



□ 



Theorem D.l. Let ^ ~ A/'(0,/„) and A be a positive semidefinite matrix. Then 

Var-i/2 (^t^^) (^^t^^ - Ee*^0 ^ -^(0' 1) 
if and only i/ Var^^/^ (^t^^^ Xi (A) 0. 
Lemma D.6. Let n > 4. T/ien 

Al (J^) < 4n-Mog2n. 
Proof. Let r = [n/logn] and note that sin(x)~^ < 2/x for x € (0,7r/2]. Then 

and 



A-^< — <^n^ <21og^n 

yrvry vr^ \2 log 



Al (j;) = (n-n/logn)-^ A-i < - log^ n 

n 



□ 
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Lemma D.7. Let Aj be as defined in (3.2). Then it holds 



n 



i=[ni/2]+i 



Proof. Let Xi = ivr/ (2n). Note that s\v? {xi) = xf — Cf/S, where G (0,Xj). Further 

2[nl/2] 



max.^^^i/2j_^^_^^^_2[ni/2] a:i < n ^/^tt. Hence 



n 



1/2 



E£,f < n max = O (n ^ 



j=[ni/2l+i 



i=[rii/2] + l,,,.,2[ni/2] 



and thus 



n 



1/2 



2[nl/2] 

5: A. = 4nV^ 



j=[ni/2l+l 



j=[ni/2]+l 
2[ni/2] 



2„-3/2 ^ i2^Q(^-l) ^1^^ 0^^-1/2 
j=[nl/2]+l 



77r2 
3 



□ 



Lemma D.8 (Continuous Sobolev Embedding). LetC {q), q > denote the space of Holder 
continuous functions on [0, 1] equipped with the canonical norm and define 



77 : (l/2,oo) X [0,oo) ^ M, rj{a,6):=< 



a- 1/2 a e (1/2,3/2), 
1-S a = 3/2, 

1 a > 3/2. 



Suppose a > 1/2. Then for any 5 > the embedding 

l: &,{a,Q)^C{r^{a,5)) 

is continuous and in particular 

sup \\f\\c{n{a.5)) < oo- 
/ee|(a,Q) 

Proof. For a given function / : [0, 1] — t- M define / : [—1, 1] - 

fix) XG[0,1], 

fyx) ■■= { 

fi-x) xe[-i,o] 
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Let for s > 0, W^'^ [— 1, l]|p ^ denote the (fractional) Sobolev space on [— 1, 1] , where the 
domain of functions is restricted to [0, 1] equipped with the norm 



Note this is a function space on [0, 1] and W^''^ [—1, ^ W^'"^ [0, 1]. By the Sobolev 
embedding theorem (see Taylor (1996), Proposition 8.5) we have for a > 1/2 that 

is continuous and since it is linear also bounded. This yields 

< oo. 



□ 



sup \\f\\c(r,{a,S)) ^ Ml sup |Mlliy«.2[_i,i]| 
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